What is claimed is: 



1 . A method of identifying at least one node close to a first node in a network, the 
method comprising: 

5 selecting a set of candidate nodes from a plurality of nodes based on location 

information for the candidate nodes and the first node; 

applying a clustering algorithm to the location information for the candidate nodes 
and the first node; and 

identifying a subset of the set of candidate nodes closest to the first node based on 
10 results of applying the clustering algorithm. 

2. The method of claim 1, wherein selecting a set of candidate nodes comprises: 
comparing location information for the plurality of nodes to the location information 

for the first node to select the set of candidate nodes from the plurality of nodes closest to the 
15 first node. 

3. The method of claim 2, further comprising: 

receiving the location information for the first node at a node in a distributed hash 
table overlay network, the distributed hash table overlay network being a logical 
20 representation of the network including the first node and the plurality of nodes; and 

storing the location information for the first node at the node in the distributed hash 
table overlay network. 
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4. The method of claim 3, further comprising: 

the first node hashing the location information for the first node to identify a location 
in the distributed hash table overlay network to store the location information for the first 
node, wherein the node in the distributed hash table overlay network is approximately at the 
5 identified location. 

5. The method of claim 3, further comprising: 

receiving the location information for the plurality of nodes at the node in the 
distributed hash table overlay network; and 
10 storing the received location information for the plurality of nodes at the node in the 

distributed hash table overlay network. 

6. The method of claim 5, further comprising: 

retrieving the location information for the plurality of nodes and the first node from 
15 stored location information at the node in the distributed hash table overlay network; and 
comparing the retrieved location information to select the set of candidate nodes 
proximal ly located to the first node from the plurality of nodes. 

7. The method of claim 1, wherein the location information for the first node and the 
20 plurality of nodes comprises landmark vectors for each of the first node and the plurality of 

nodes, the landmark vectors for each of the first node and the plurality of nodes including 
distances measured to a plurality of global landmark nodes and at least one local landmark 
node proximally located to a respective one of the first node and the plurality of nodes. 
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8. The method of claim 7, wherein comparing location information for the plurality of 
nodes to the location information for the first node comprises: 

comparing global landmark vector portions of the landmark vectors for the first node 
and the plurality of nodes; and 
5 selecting candidate nodes from the plurality of nodes having landmark vectors with a 

predetermined similarity to the landmark vector for the first node. 

9. The method of claim 7, wherein the at least one local landmark node proximally 
located to a respective one of the first node and the plurality of nodes is one of on a routing 

10 path between the respective node and one of the plurality of global landmark nodes and 
within a predetermined distance to the respective node. 

10. The method of claim 1, further comprising: 

determining distances to each of the subset of candidate nodes from the first node; and 
1 5 selecting a closest node to the first node from the subset of candidate nodes based on 

the determined distances. 

1 1 . The method of claim 1 , further comprising: 

selecting a node from the subset of nodes based on at least one of distances to each of 
20 the subset of candidate nodes from the first node and quality of service characteristics 
associated with the subset of nodes. 
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12. The method of claim 1, wherein the clustering algorithm is an algorithm operable to 
identify similarities between the location information for the first node and the candidate 
nodes. 

5 13. The method of claim 12, wherein the clustering algorithm comprises at least one a 
minsum, maxdif f, order, inner product algorithm, k-means, principal component analysis, 
and latent semantic indexing. 

14. A node in a network comprising: 

10 means for selecting a set of candidate nodes from a plurality of nodes based on 

location information for the candidate nodes and a first node; 

means for applying a clustering algorithm to the location information for the candidate 
nodes and the first node; and 

means for identifying a subset of the set of candidate nodes closest to the first node 
15 based on the results of applying the clustering algorithm. 

15. The node of claim 14, further comprising: 

means for receiving the location information for the plurality of nodes and the first 
node; and 

20 means for storing the location information for the plurality of nodes and the first node. 
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16. The node of claim 15, further comprising: 

means for retrieving the location information for the plurality of nodes and the first 
node from the means for storing; and 

means for comparing the location information for the plurality of nodes and the first 
5 node to select the candidate nodes. 

1 7. The node of claim 1 4, further comprising means for transmitting a list of the subset of 
candidate nodes to the first node. 

10 18. A computer system operable to connect to a peer-to-peer network and operable to 
function as a distributed hash table node in a distributed hash table overlay network, the 
distributed hash table overlay network being a logical representation of the peer-to-peer 
network, wherein the computer system comprises: 

a memory operable to store location information for a plurality of nodes in the peer-to- 
15 peer network that are physically close in the network; and 

a processor operable to compare the location information for the plurality of nodes to 
location information for a first node to identify a set of nodes from the plurality of nodes that 
are physically close to the first node in the peer-to-peer network. 

20 19. The computer system of claim 18, wherein the processor is operable to apply a 
clustering algorithm to the location information for the set of nodes to identify a subset of 
nodes closest to the first node. 
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20. The computer system of claim 1 8, wherein the location information for the first node 
and the plurality of nodes comprises distances measured from each of the first node and the 
plurality of nodes to a plurality of global landmark nodes and to at least one local landmark 
node proximally located to a respective one of the first node and the plurality of nodes. 

5 

2 1 . Computer software embedded on a computer readable medium, the computer software 
comprising instructions performing: 

selecting a set of candidate nodes from a plurality of nodes based on location 
information for the candidate nodes and a first node; 
10 applying a clustering algorithm to the location information for the candidate nodes 

and the first node; and 

identifying a subset of the set of candidate nodes closest to the first node based on the 
results of applying the clustering algorithm. 

15 22. The computer software of claim 2 1 , wherein instructions performing selecting a set of 
candidate nodes comprises: 

comparing location information for the plurality of nodes to the location information 
for the first node to select the set of candidate nodes physically close to the first node. 

20 23. The computer software of claim 26, wherein the location information for the first node 
and the plurality of nodes comprises distances measured from each of the first node and the 
plurality of nodes to a plurality of global landmark nodes and to at least one local landmark 
node proximally located to a respective one of the first node and the plurality of nodes. 
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24. A method of storing information in a node in a network, wherein the node is operable 
to function as a distributed hash table node in a distributed hash table overlay network, the 
distributed hash table overlay network being a logical representation of the network, wherein 
the method comprises: 

receiving location information for a plurality of nodes, the nodes being located 
physically close in the network; and 

storing the location information in a table, wherein the location information for the 
plurality of nodes comprises distances measured from each of the plurality of nodes to a 
plurality of global landmark nodes and to at least one local landmark node. 

25. The method of claim 24, further wherein the at least one local landmark node is 
proximal ly located to a respective node of the plurality of nodes. 

26. The method of claim 24, further comprising: 

storing measured network metrics associated with at least one of the plurality of nodes 
in the table. 
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